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The leader RNA of the 5’ untranslated region (UTR) of coronaviral genomes 
contains two stem-loop structures denoted SL1 and SL2. Herein, we show 
that SL1 is functionally and structurally bipartite. While the upper region of 
SL1 is required to be paired, we observe strong genetic selection against 
viruses that contain a deletion of A35, an extrahelical nucleotide that 
destabilizes SL1, in favor of genomes that contain a diverse panel of desta- 
bilizing second-site mutations, due to introduction of a noncanonical base 
pair near A35. Viruses containing destabilizing SL1-AA35 mutations also 
contain one of two specific mutations in the 3’ UTR. Thermal denaturation 
and imino proton solvent exchange experiments reveal that the lower half of 
SL1 is unstable and that second-site SL1-A.A35 substitutions are character- 
ized by one or more features of the wild-type SL1. We propose a “dynamic 
SL1” model, in which the base of SL1 has an optimized lability required to 
mediate a physical interaction between the 5’ UTR and the 3’ UTR that 
stimulates subgenomic RNA synthesis. Although not conserved at the 
nucleotide sequence level, these general structural characteristics of SL1 
appear to be conserved in other coronaviral genomes. 

© 2008 Elsevier Ltd. All rights reserved. 
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Introduction 


on serologic relatedness, genetic organization, and 
sequence similarity. Mouse hepatitis virus (MHV) 
is the most extensively studied CoV, is genetically 


Coronaviruses (CoVs) comprise a large group of 
enveloped, single-stranded, positive-sense mRNA 
viruses classified in the family Coronaviridae of the 
order Nidovirales.' CoVs infect many different verte- 
brate hosts and induce a broad spectrum of diseases, 
including severe acute respiratory syndrome (SARS).”” 
CoVs have been classified into three subgroups based 
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Abbreviations used: UTR, untranslated region; CoV, 
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MHV, mouse hepatitis virus; g3NA, genomic RNA; 
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subgenomic RNA; WT, wild type; RT, reverse 
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tractable, and provides small animal models for se- 
veral diseases including SARS," a disease caused by 
the related group 2 SARS CoV. 

Like other CoVs, MHV has a large ~32-kb positive- 
sense viral RNA genome flanked by 5’ and 3’ un- 
translated regions (UTRs). The 5’ two-thirds of the 
genome encodes the replicase complex (open read- 
ing frame 1), whose expression requires a —1 la/1b 
translational frameshift,” with downstream open 
reading frames encoding a variety of structural and 
accessory proteins required for replication and 
assembly of infectious viruses.’”” MHV transcription 
generates a set of nested subgenomic mRNAs, 
which contain 5’ and 3’ regions that are identical 
with those of the genomic RNA (gRNA). 7° The 
most widely accepted model used to explain this 
unique discontinuous transcription mechanism 
posits that subgenomic negative-strand RNAs are 
synthesized from genome-sized template, and that 
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the subgenomic mRNAs are synthesized from sub- 
genomic minus strands.'°'* Zuniga et al. have 
refined this model by providing strong support for 
the hypothesis that heptameric core intergenic se- 
quences or “body” transcription regulatory se- 
quences (TRS-Bs) signal template switching to the 
TRS 5’ leader (TRS-L), as well as suggest the exis- 
tence of RNA-RNA interactions between the 5’ UTR 
and 3’ UTRs and the proteins bound to these UTRs."* 

Many cis-acting sequences in the 5’ UTR have 
been shown to play an important role in RNA trans- 
cription and replication;'*-!® however, insights into 
the secondary structure of the 5’ UTR were, until 
recently, limited. We have developed a consensus 
secondary structure model based on the sequences 
of nine CoVs, in which the 5’ leader RNA, consisting 
of the 5’ most 72 nucleotides of the 5’ UTR in MHV, 
is characterized by two conserved stem-loop struc- 
tures, denoted SL1 and SL2.'”?° We have shown that 
SARS-CoVs SL1 and SL2 can replace their MHV 
counterparts in an otherwise MHV gRNA; these 
chimeric viruses are transcriptionally active and re- 
plication competent.'” This experiment strongly 
supports the predicted secondary structural archi- 
tecture of the 5’ UTR in MHV. SL2 is a highly 
conserved pentaloop containing structural features 
consistent with a U-turn architecture that plays a 
critical role in subgenomic RNA (sgRNA) synthesis, 
as well as in translation. 

In this report, we extensively characterize the 
functional and structural properties of SL1. We show 
that SL1 adopts a bipartite structure with the func- 
tional boundary of two consecutive pyrimidine— 
pyrimidine (pyr—pyr) base pairs in the middle of the 
SL1 helix.*° The upper region of the stem must be 
base paired to support viral replication to near-wild- 
type (WT) levels. In striking contrast, characteriza- 
tion of mutant viruses in which an unpaired 
nucleotide, A35, is deleted leads to the recovery of 
a diverse collection of mutant viruses that contain 
single-nucleotide second-site mutations in the lower 
half of SL1, all of which are kinetically and thermo- 
dynamically destabilizing. In addition, these viruses 
contain one of two single-nucleotide substitutions in 
the extreme 3’ UTR of the genome. Nonviable mu- 
tants in this region of SL1, like SL2 mutants 
characterized previously,”” fail to support sgRNA 
synthesis. These data, taken collectively, suggest 
that the base of SL1 must adopt an optimized sta- 
bility or kinetic lability to mediate a long-range 
interaction between the 5’ UTR and the 3’ UTR that 
is an obligate step in the transcription of sgRNAs. 
The nature of this interaction is discussed. 


Results 


SL1 folds in solution and adopts a functionally 
bipartite structure 


A series of NMR studies of SL1-containing RNA 
constructs from both HCoV-OC43 and MHV docu- 


(a) (b) (c) 
21gU Ac AC AC 
206 Ges a Ges U Gs 
19C U 2G U aan 
5 acta/aceo nA 16 
G:C 7G: aan 
"C6.-6 ACIB6==6G-6 ee 
G Cs wes G+ G30 
e 1 
Pye ge 2022 ay ete peetes 
QB a: Bai A: US ecais 
"G ; hoch "O ; Koch’ ea “CH asa 
G: C37 G:C37 A10 &! O37 *ayse 
oon ae, A-U . AA35/A36U 
Ri 0 awe” 5e Gao U37 
UAU 9 A 
5 5 «4 
WI WT* AA35 


Fig. 1. Predicted secondary structure model of MHV- 
A59 SL1 constructs discussed here. (a) WT; (b) WT*, 
which corresponds to SL1-A(C16/C19/C20) with a 5’-GA 
overhang required for initiation of in vitro transcription by 
SP6 polymerase; and (c) second-site revertant SL1s de- 
rived from infection with the SL1-AA35 virus (AA35 RNA 
sequence shown; recovered single-nucleotide substitu- 
tions highlighted in red). The constructs labeled WT*, 
AA35, AA35/U33C, AA35/C34U, and AA35/A36U 
RNAs (b and c) were used for the NMR and thermal un- 
folding studies. The 5’ g is a non-native nucleotide, shown 
in lower case. 


ment the key predicted structural features of SL1 in 
our model,2° the most notable of which are two 
consecutive pyr-pyr base pairs, U12eC32 and 
U13eU31, in the middle of the helix, flanked on 
either side by Watson—Crick A11-U33 and G14-C30 
base pairs (Fig. 1a). In addition, these NMR studies 
also suggested that either A35 or A36 is an unpaired 
extrahelical nucleotide and that the alternative 
pairing of U9 with A35 or A36 introduces a local 
destabilization in this region of SL1 stem.”° Inter- 
estingly, the predicted SL1s in other coronaviral 
genomes also appear to conserve this general feature 
of a rather weakly paired region at the base or 
middle of SL1, containing either or both noncano- 
nical base pairs and extrahelical nucleotides, sug- 
gesting that this might be important for the repli- 
cation of MHV and perhaps other CoVs. This served 
as the motivation for the construction of a mutant 
MHV genome harboring a single-nucleotide dele- 
tion of A35 (SL1-AA35)f. 


Mutational analysis of MHV SL1 base pairing 


Since NMR data supported the existence of SL1 in 
5’-leader-containing RNA fragments,”” we first 
investigated the functional importance of SL1 in 
viral replication. One set of mutations targeted the 


f{ Although A35 or A36 (or both, one at a time) could 
theoretically be extruded from the SL1 helix, extrusion of 
either one gives rise to the same structure of paired bases 
in the SL1 stem. In this work, we characterized SL1-AA35 
MHV mutants, but SL1-AA36 viruses would give rise to 
exactly the same sequence in SL1 and thus would not be 
functionally distinguishable from the SL1-AA35 virus. 
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Table 1. Effect of mutations in SL1 on plaque production 


Virus name Mutation recovered Intended effect of mutation on SL1 Plaque diameter (mm)* 


SL1 stem mutants 


MHV-A59 1000 WT 2.37+0.05 
SL1-A G14C/G15U/G17C Destroys base pairing in the upper region of the stem Not viable 
SL1-B C28G/C29A/C30G Destroys base pairing in the upper region of the stem 1.82+0.04 
SLI-AB C28G/C29A/C30G Maintains base pairing in the upper region of the stem 2.16+0.04 
G14C/G15U/G17C 
SL1-C G6C/A7U/G8C Destroys base pairing in the lower region of the stem 1.55+0.03 
SL1-D C37G/U38A/C39G Destroys base pairing in the lower region of the stem 1.77+0.05 
SL1-CD G6C/A7U/G8C Maintains base pairing in the lower region of the stem Not viable 
C37G/U38A/C39G 
Cytidine deletion mutants 
MHV-A59 1000 WT 2.14+0.06 
SL1-AC16 AC16 Removes a bulged C from the upper region of the stem 2.03+0.05 
SL1-AC19/C20 AC19/C20 Removes two bulged Cs from the upper region of the stem 2.08+0.09 
SL1-AC16/C19/C20 AC16/C19/C20 Removes all bulges from the upper region of the stem 1.87+0.06 
SL1-AA35 mutant and second-site suppressor mutants 
MHV-A59 1000 WT 2.74+0.06 
SL1-AA35-PL6-ps1 AA35/3/A29G Removed bulged A from the lower region of the stem 2.47 +£0.09 
SL1-AA35-PL6-ps2 AA35/G10A/3'A29G Removed bulged A from the lower region of the stem 2.53 £0.08 
SL1-AA35-PL12-ps2.  AA35/C34U/3’A78G Removed bulged A from the lower region of the stem 2.13+0.08 
SL1-AA35-PL4-ps1 AA35/U33C/3/A29G Removed bulged A from the lower region of the stem 2.57+0.09 
SL1-AA35-PL1-ps1 AA35/A36U/3'A29G Removed bulged A from the lower region of the stem 2.46+0.07 
SL1-AA35-PL3-ps1 AA35/C37U/3/A29G Removed bulged A from the lower region of the stem 2.80+0.10 
SL1-AA35-PL14-psl_ = AA35/C37U/3'A78G Removed bulged A from the lower region of the stem 2.54+0.07 
Mutants that target the lower SL1 stem 
MHV-A59 1000 WT 2.2+0.05 
SL1-A5G/U40C A5G/U40C Increased stability of lower stem Not viable 
SL1-U40C U40C Control for A5G/U40C 2.1+0.06 
SL1-A5G A5G Control for A5G/U40C 1.8+0.06 
SL1-A5C/U40C A5C/U40C Control for A5G/U40C 2.1+0.06 
SL1-A7G/U38C WT Increased stability of the lower region of the stem Unstable 


Data are from four separate experiments. A WT MHV-A59 is included for comparison with each experiment. 
* Plaque diameters were calculated as an average of at least 50 plaques, as described previously.*! 


upper portion of SL1 above the pyr—pyr base pairs, 
while a second set targeted the lower region of SL1 
(Table 1). Plaque size phenotypes (Table 1, Supple- 
mentary Fig. 1) and one-step growth curves (Fig. 2, 
Supplementary Fig. 2) were measured for all viable 
viruses. Deletion of each of the three cytidines uni- 
que to MHV (C16, C19, and C20, with C16 predicted 
to be extrahelical), either singly or altogether, gave 


rise to viable viruses with WT-like growth char- 
acteristics (Supplementary Fig. 2a). Thus, the C resi- 
dues are dispensable for viral replication, justifying 
our use of the WT model (denoted WT*) in physical 
studies presented below (see Figs. 1b, 5-7). Deletion 
of these cytidines results in three consecutive G-C 
base pairs in the upper segment of the stem-loop 
(Fig. 1b). 
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Fig. 2. One-step growth curves 
of viable mutant and WT (MHV- 
A59 1000) viruses. (a) Viruses con- 
taining mutations in the upper 
(SL1-B and SL1-AB) and lower 
(SL1-C and SL1-D) portions of SL1 
(see Table 1 for sequences). (b) Vi- 
ruses recovered from infection with 
SL1-AA35 viruses. 
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Mutations that disrupt the three consecutive G-C 
base pairs in the WT genome either were nonviable 
(SL1-A) or resulted in a virus with a moderately 
smaller plaque size (SL1-B). Computer-assisted mo- 
deling with Mfold and ViennaRNA (see Kang et 
al.'*) indicates that the SL1-B mutant may adapt an 
alternative stable folding that largely maintains the 
SL1 structure, whereas the SL1-A mutant does not 
(data not shown). In contrast, the compensatory 
mutant SL1-AB—containing both the SL1-A and the 
SL1-B mutations and predicted to restore base 
pairing—possessed a similar plaque size and had 
almost identical growth kinetics as the WT virus 
(Table 1, Fig. 2a, Supplementary Fig. 1). These data 
reveal that the upper portion of SL1 must be base 
paired in order to support efficient virus replication. 

In contrast, the mutations introduced below the 
two consecutive pyr—-pyr base pairs gave rise to 
completely opposite results. Two viral genomes 
carrying mutations that destroy base pairing in 
this region (SL1-C and SL1-D) were found to be 
viable, albeit with significantly smaller plaque sizes 
and slower one-step growth curves than the WT 
virus (Table 1, Fig. 2a, Supplementary Fig. 1). 
Surprisingly, the corresponding double transversion 
mutant predicted to restore base pairing here (SL- 
CD) was not viable (Table 1). 


Deletion A35 gives rise to second-site 
suppressor mutations in both SL1 and 3’ UTR 


To further investigate the functional importance of 
the lower SL1 region, we characterized mutant 
viruses containing a deletion of A35 (SL1-AA35). 
From two completely independent transfections, we 
recovered 12 plaque-purified viruses. Sequencing of 
the complete 5’ and 3’ UTRs of the 12 plaque- 
purified viruses revealed that only 2 of these viruses 
contained the SL1-AA35 sequence; the remaining 10 
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viruses made up a diverse collection of single- 
nucleotide second-site substitutions in the immedi- 
ate vicinity of the deleted nucleotide. Several of 
these plaque isolates contained identical sequences 
in their 5’ and 3’ UTRs. Viruses with identical se- 
quences had similar plaque sizes. The plaque pheno- 
types of one example of each second-site mutation 
are presented in Table 1. These mutations include 
C37U, A36U, C34U, and U33C (from the base of 
SL1 toward the hairpin loop; see Fig. 1c) in the 
context of the parent SL1-AA35 deletion. Addition- 
ally, viruses containing the SL1-AA35 sequence 
were completely replaced by viruses containing a 
second-site mutation (G10A) in SL1 after only one 
passage in DBT cells, confirming that viruses con- 
taining only a deletion of A35 were genetically un- 
stable and were subjected to strong negative selec- 
tion. Remarkably, although diverse in sequence, all 
second-site mutants possess a common property: 
each is predicted to change a Watson—Crick base 
pair in the WT SL1 to a noncanonical base pair 
(vide infra). 

In addition to these substitutions in the 5’ UTR, all 
recovered viruses contained one of two second-site 
single-nucleotide substitutions in the 3’ UTR, 3/- 
A29G or 3’-A78G [3’, counting from the 3’ nucleo- 
tide of the 3’ UTR in the genomic strand, before the 
poly(A) tail], near the very 3’ end of the genome 
some 30 kb distant. In one case (SL1-AA35/C34U), 
the passage 1 virus contained a mixture of the 3’- 
A78G and WT sequences. The WT sequence was lost 
after one additional passage (Fig. 4a). Both A29G 
and A78G mutations in the 3’ UTR share the same 
sequence context, GAGG, relative to the same WT 
context, GAAG. In addition, in at least one case 
(C37U), both A29G and A78G substitutions were 
recovered; this is consistent with the idea that each 
mutation, which coevolves with the 5’ UTR muta- 
tion, is functionally equivalent. These data argue for 


O SL1- AA35/C37U/3'A29G 
® SL1- AA35/C37U/3'A78G 


0.5 Vv rSL1- AA35/C37U/3'A29G 
4 rSL1- AA35/C37U/3'A78G 
0.0 2 : 
S © & © © 0) 10 20 30 Fig. 3. Phenotypes of recombi- 
av’ KS ae” ae Time post-infection (h) nant SL1-AA35 second-site sup- 
as’ as’ as’ as’ pressor viruses. (r) indicates recom- 
L Res Le Rs binant virus versus those recovered 
¥ A ¥ yr from infection with SL1-A.A35 vir- 
oy oy oy so ion genomes. (a) Plaque size (mm). 
& & 
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a physical interaction between 5’ UTR and 3’ UTR in 
MHV and suggest that it is mediated by a “desta- 
bilized” or “dynamic” SL1. 

To confirm that the phenotypes of the second-site 
suppressor viruses that arose after electroporation 
with the SL1-AA35 mutant genome were due to the 
second-site mutations we detected in the 5’ and 3’ 
UTRs, and not due to other mutations in regions of 
the genome that were not sequenced, we examined 
two mutants, SL1-AA35/C37U/3’A29G and SL1- 
AA35/C37U/3'A78G, in more detail. We cons- 
tructed genomes containing these mutations using 
our standard reverse genetic system and generated 
recombinant viruses (designated rSL1-AA35/C37U/ 
3'A29G and rSL1-AA35/C37U/3'A78G). The se- 
quence of the 5’ and 3’ UTRs of these viruses was 
verified, and their plaque morphologies and growth 
phenotypes were compared with the corresponding 
SL1-AA35 second-site suppressor mutants. Each of 
the six plaque isolates of the recombinant mutant 
viruses rSL1-AA35/C37U/3’A29G and rSL1-AA35/ 
C37U/3/A78G contained no additional mutations in 
their 5’ and 3’ UTRs, and their plaque sizes and 
morphologies are very similar to those of the ori- 
ginally recovered mutants SL1-AA35/C37U/3’ 
A29G and SL1-AA35/C37U/3'A78G (Fig. 3a, Sup- 
plementary Fig. 3). These data strongly suggest that 


(a) (b) 
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the phenotypes of the originally recovered SL1- 
AA35 second-site revertants are indeed due to the 
mutations in their 5’ and 3’ UTRs, rather than due to 
other uncharacterized mutations elsewhere in the 
genome. 


Mutations that increase the stability of the lower 
region of SL1 are not viable 


Genomes containing A-U to G-C base-pair sub- 
stitutions that are predicted to stabilize the lower 
region of SL1 are either nonviable (SL1-A5G/U40C) 
or unstable (SL1-A7G/U38C). In the case of SL1- 
A5G/U40C, single- and double-nucleotide substitu- 
tions that destroy the original base pair (SL1-A5G, 
SL1-U40C, or SL1-A5C/U40C) are all viable and 
characterized by near-WT-like plaque sizes and 
growth curve characteristics ([able 1, Supplemen- 
tary Figs. 1 and 2b). Strikingly, the SL1-A7G/U38C 
virus is a true revertant, with the A7G substitution 
repaired in the first passage and with the U38C 
substitution repaired next to generate the original 
A7-U38 base pair (Fig. 4b). These data, taken 
collectively, suggest that the lower portion of SL1 
is weakly base paired or not at all (vide infra), and 
that this region of the stem plays a critical role in 
viral replication. 


SL1-A7G/U38C (Transcribed Genome) 


\/ 
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Fig. 4. Sequence scans of se- 
lected mutants. (a) Selection of the 
mutation at position 3’A29G in the 
SL1-AA35/C34U virus. (b) True re- 
version of the introduced mutant 
G7-C38 base pair to the WT A7-U38 
base pair. The scans shown corre- 
spond to negative-sense sequences. 
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Mutations in SL1 influence RNA synthesis 


To determine whether the altered growth pheno- 
types of MHV SL1 mutant viruses correlated with 
defects in genome replication or sgRNA synthesis, 
virus-specific RNAs were metabolically radiola- 
beled and analyzed by gel electrophoresis. Cells 
infected with MHV SL1 mutant viruses synthesized 
both gRNA and the seven sgRNAs in approximately 
the same ratios as observed for a WT virus infection 
(Supplementary Fig. 4). SL1-B, SL1-C, SL1-D, SL1- 
AC16/19/20, and SL1-AA35/C34U/3'A78G 
viruses appeared to synthesize the lowest absolute 
quantities of gRNAs and sgRNAs (<32% of WT), 
while the other SL1 mutant viruses directed the 
synthesis of virus-specific RNAs to levels greater 
than 55% of those observed with WT virus. One 
nonviable mutant, SL1-A, was also assayed for the 
ability to synthesize sg RNA 7 and gRNAs by a series 
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of reverse transcription (RT) PCR assays of RNA 
extracted from electroporated cells. Previous char- 
acterization of nonviable SL2 mutants revealed that 
the primary defect in these genomes was that they 
failed to support the transcription of oSRNAs, 
whereas replication of gRNA was intact,”’ with the 
latter evidenced by a negative-sense gRNA replica- 
tion intermediate. The same is true of the SL1-A 
mutant (Fig. 5). A comparison of the in vitro trans- 
lation efficiencies of fusions of WT and several SL1 
mutant 5’ UTRs that gave rise to nonviable genomes 
to luciferase reporters found no effect on transla- 
tional efficiency (data not shown). 


A low thermostability of SL1 is crucial for virus 
stability and viability 


The functional data presented above are consis- 
tent with what we term a “dynamic SL1” hypothe- 


Kev (+) ei (-) input fg 
8 24 8 24 8 24h 


=m 559 bp 


MHV-A59 1000 


(-) (+) 
8 24 8 24h 


mm 330 bp 


MHV-A59 1000 


GAPDH 
MHV-A59 1000 


Fig. 5. Analysis of MHV-specific RNA synthesis. In all cases, BHK-R cells were electroporated with in vitro transcripts 
corresponding to the genomes of MHV-A59 1000 (WT), SL1-A, or a A59/nsp12-FS frameshift mutant incapable of 
directing the synthesis of viral RNAs. Total RNAs were extracted at the times indicated (4 hpe, 8 hpe, and 12 hpe) and 
analyzed by RI-PCR (see Materials and Methods) for (a) negative-sense anti-gRNA; (b) negative- and positive-sense 
sgRNA 7; or (c) gRNA. Note that total RNA is used as the template for RT in (a) and (b); poly(A) * RNA was used as the 
template in (c), and duplicate samples were analyzed. The arrows indicate the position of the amplicons expected for each 


RNA species. GAPDH, RNA recovery control. 


796 


An Interaction Between the 5’ UTR and the 3’ UTR in MHV 


sis, which posits that the lower region of SL1 must 
be thermodynamically destabilized and/or dyna- 
mically (kinetically) labile in a way that is dependent 
only on the general physical features of this region of 
the SL1, rather than on the precise nucleotide se- 
quence, in order to fully support virus replication. In 
order to test this hypothesis and to gain additional 
insight into SL1 structure, we synthesized three re- 
presentative second-site SL1 revertant mutant RNAs 
(see Fig. 1c), measured their thermodynamic stabi- 
lities using quantitative optically monitored thermal 
denaturation methods (Fig. 6), and measured their 
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dA/dT 


AA35/U33C 
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Fig. 6. Comparison of the thermal unfolding of the 
WT*, AA35, AA35/U33C, AA35/C34U, and AA35/A36U 
SL1 RNAs. The experimental optical melting profiles 
show every fifth data point collected at 260 nm (@) and 
280 nm (O), with the calculated fits (dashed lines) shown. 
For the AA35 RNA, the nonlinear least-squares simulta- 
neous composite fit to a single transition unfolding model, 
and the transition is shown in solid line. For the WT* 
model and three recovered SL1 mutant AA35/U33C, 
AA35/C34U, and AA35/A36U RNAs, a nonlinear least- 
squares simultaneous composite fit to a two-transition 
unfolding model, and component transitions 1 and 2 
(solid lines) are shown. The thermodynamic parameters 
derived from these fits are compiled in Table 2. 


G14i7 AA35/U33C 


Gio ,U31 


1H ppm 


Fig. 7. Imino proton regions of 1D jump-return echo 
spectra acquired at 10 °C and 10 mM KP; (pH 6.0) for WT* 
(a), AA35 (b), AA35/U33C (c), 4A35/C34U (d), and 
AA35/A36U (e). Imino protons corresponding to non- 
canonical base pairs are shown in bold. Note that some 
spectra (WT*, AA35/U33C, and AA35/C34U) are char- 
acterized by slow conformational heterogeneity at the 
base portion of SL1 (A5-U40, G6-C39, and A7-U38 base 
pairs). 


structural (Fig. 7) and dynamic (Fig. 8) properties by 
NMR spectroscopy. 

Optical melting profiles (dA /dT) are shown for the 
WT*, AA35, 4A35/U33C, AA35/C34U, and AA35/ 
A36U SL1 RNAs in Fig. 6, with the thermodynamic 
parameters described from a quantitative analysis of 
these melts compiled in Table 2. In contrast to all of 
the other RNAs, the optical melting profile of the 
AA35 SL1 RNA is well-modeled by a single two- 
state unfolding transition with a t,, of 80.3 °C and 
with a van’t Hoff enthalpy of unfolding (AHy) of 
118 kcal mol”* (Table 2). Thermodynamic para- 
meters derived from melting curves acquired using 
differential scanning calorimetry are in excellent 
agreement with the optical melts, as well as the 
expected AH estimated from the nearest-neighbor 
model.”* In contrast, melting profiles obtained for 
the WT* RNA, as well as for the three AA35 second- 
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site revertant mutant RNAs, are broader than that 
obtained for the AA35 RNA; they also unfold at 
significantly lower ty. A broadened unfolding tran- 
sition is consistent with a superposition of two se- 
quential two-state unfolding transitions (e.g., the 
lower and the upper regions of the SL1 stem) or 
extreme non-two-state unfolding behavior of the SL1 
stem in a single transition. In both cases, fitting of 
these melting profiles to a single two-state transition 
would result in insufficient van’t Hoff enthalpy of 
unfolding (AHy;;) to account for the unfolding of the 
entire 14-base-pair stem (Fig. 6). This is exactly what 
we observe, with the apparent AH, of 70-75 kcal 
mol ' in each case (fits not shown). Invoking a 
sequential two-state unfolding transition model 
reveals two unfolding steps with different ampli- 
tudes. If we assign the first unfolding step to the 
denaturation of the lower four base pairs in SL1 
(predicted AHy;=38 kcal mol ?; t=61 °C),”* then 
we recover sufficient AH, in the second (major) 
transition (70-86 kcal mol ') to account for unfold- 
ing of the entire molecule. From these fits, we calcu- 
late that the WT RNA is destabilized by 3.0 kcal 
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Fig. 8. Graphical representation 
of the imino proton solvent ex- 
change rates (k,,) for the SL1 WT* 
and mutants. (a) Secondary struc- 
ture of the WT* RNA. (b) k,, is 
plotted versus base-pair position 
(from the bottom to the top of the 
SL1 helix, from left to right). The 
mutations are shaded in red. k,, 
could not be unambiguously mea- 
sured for the G8 for the middle 
three RNAs due to spectral overlap 
with G15; the same is true of G14 
and G17 imino protons in all spec- 
tra. In these cases, the average k,, is 
plotted. 


mol ! relative to the AA35 mutant, while the three 
second-site revertants are destabilized by 2.1 kcal 
mol | (AA35/U33C), 4.5 kcal mol * (AA35/C34U), 
and 4.7 kcal mol ' (AA35/A36U) . These data reveal 
that each of the recovered revertant RNAs shares a 
common physical property with the WT RNA (ie., 
all are thermodynamically destabilized relative to 
the AA35 mutant). 


Mutant SL1 RNAs experience enhanced 
dynamics and conformational heterogeneity 


Since thermodynamic destabilization is a global 
property of a molecule, we used NMR spectroscopy 
in an effort to localize changes in the structure and / 
or dynamics that could account for this. One-dimen- 
sional (1D) imino proton spectra (10 °C, pH 6.0) for 
all five RNAs are shown in Fig. 7. What is imme- 
diately apparent for these RNAs is that the antici- 
pated noncanonical base pairings [i.e., G10eU34 in 
AA35/C34U RNA (Fig. 6), and U9®U36 in AA35/ 
A36U (Fig. 6)] are present, since the imino protons of 
U34 and U36 are strongly protected from exchange 
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Table 2. Thermodynamic parameters derived for the 
unfolding of WT*, SL1-AA35, SL1-AA35/U33C, SL1- 
AA35/C34U, and SL1-AA35/A36U RNAs 


Transition 1° Transition 2 


RNA AM, tmm1 AH» tno AG°37 AAG°37 
WT* 37.9 60.8 81.0 74.5 —11.5 3.0 
AA35 118.3 80.3 - - —14.5 - 
AA35/U33C 37.9 60.8 85.7 76.7 —124 2.1 
AA35/C34U 37.9 60.8 72.0 72.2 -—10.0 4.5 
AA35/A36U 37.9 60.8 69.8 72.2 —9.8 4.7 


Enthalpy and free energy are reported in kilocalories per mole, 
and tm is reported in degrees Celsius. Thermodynamic para- 
meters derived from the optical melting profiles determined using 
the two-state van’t Hoff unfolding model, with AC, =0.” 

* Parameters for transition 1 were fixed to the predicted values 
for the unfolding of the bottom four base pairs of SL1, with tran- 
sition 2 parameters optimized during the fit (see the text for de- 
tails). The AGs37; for the ith individual transition was obtained from 
AG°37= AH-—310.15 x AS, where AS= AH/tm. AG°37= YAGsz7', 
and AAG°s; is expressed relative to the AG°37 measured for the 
SL1-AA35 RNA. 


with solvent in each case.7>-?” Although we did not 
directly determine whether the A33U substitution 
resulted in a new AeC base pair, this is expected, 
since this substitution would give rise to exactly the 
same three noncanonical pairings that characterize 
SL1 of a related group 2 CoV HCoV-OC43; in that 
RNA, a protonated A +eC base pair is formed.” In 
addition, the finding that this RNA is the least 
destabilized relative to the AA35 mutant is not com- 
patible with the presence of extrahelical nucleotides 
in each of these positions. Thus, all three revertant 
RNAs are characterized by multiple noncanonical 
pairings in an otherwise perfectly base-paired heli- 
cal stem. 

Inspection of these spectra (Fig. 7), as well as a 
compilation of the imino proton solvent exchange 
rates (Fig. 8), reveals that thermodynamic destabi- 
lization manifests itself in multiple complex ways. 
First, the WT RNA imino proton spectrum reveals a 
superposition of at least two conformations in slow 
exchange on the lH NMR timescale, as evidenced by 
peak doubling of the U40, G6, and U38 imino reso- 
nances at the base of the stem. Exactly the same type 
of heterogeneity is observed in the AA35/U33C and 
AA35/C34U RNAs, but to a far greater degree— 
with the AA35/A36U RNA being more like the WT* 
RNA in this regard. All heterogeneity is lost in the 
AA35 parent RNA. 

The imino proton solvent exchange rates, kx, re- 
veal additional insight into SL1 dynamics. Although 
kx is a complex function of the rate constants for the 
opening (k,,) and closing (k.1) of the base pair (where 
the two-state equilibrium that is constant for base- 
pair opening, Ko,, is defined by ko, /k.), as well as of 
the intrinsic rate constant for proton exchange by 
base catalyst (k,,), the magnitude of k,, often tracks 
with the magnitude of k,, and Ko,,7°°° since ex- 
change will occur only very slowly from the base- 
paired state.°’** These data reveal that the WT* 
RNA is dynamically asymmetric, with imino pro- 
tons above the U13-U31 base pair characterized by 


slow kex (<2 s '), while those below this base pair 
are ~2- to ~50-fold faster. For the AA35 RNA, imino 
proton solvent exchange rates are globally quenched, 
but most strongly near the site of the deletion. For 
example, the k,, values of U9, G10, and U31 are 
strongly attenuated in the AA35 RNA by = 10-, =32-, 
and ~5-fold, respectively. Inspection of the base-pair 
dynamics for the three mutant SL1 RNAs reveals that 
each reintroduces or accentuates one or more specific 
features of the kinetic lability that characterizes the 
WT* RNA. For example, in all three RNAs, the imino 
proton solvent exchange rates of the U13eU31 base 
pair are elevated relative to the AA35 RNA by 4- to 
10-fold, with some more so than in the WT* RNA. 
This perturbation is local in the AA35/U33C RNA, 
but next nearest neighbor and long range in the 
AA35/C34U and AA35/A36U RNAs, respectively. 
The solvent exchange rate of G10 is also increased in 
all RNAs, most prominently in the AA35/C34U 
RNA in the context of the new G10eU34 base pair- 
ing, but also in the other two revertant RNAs (by 3- 
to 4-fold). These data argue that the base of SL1 
through the pyrimidine base-pairing region in both 
the WT* and the second-site mutant SL1s recovered 
from the genetically unstable SL1-AA35 virus must 
be conformationally heterogeneous and dynami- 
cally unstable; this, in turn, allows this region to 
become transiently unfolded, so that a long-range 
interaction with the 3’ UTR in MHV can occur. 


Discussion 


The molecular mechanisms by which CoVs carry 
out sgRNA synthesis and ultimately coordinate this 
process with replication of the gRNA and translation 
of the genome remain poorly understood. An early 
event in these processes, however, may well be 
genome circularization, which places the 5’ and 3’ 
termini of the viral genome in close physical prox- 
imity to facilitate template switching during sgRNA 
minus-strand synthesis. Template switching is sim- 
ply hybridization of the nascently synthesized minus 
strand corresponding to the intergenic or TRS-B 
sequences with the complementary leader TRS, a 
process expected to be strongly stimulated by the 
close physical proximity of the two complementary 
sequences. This process is almost certainly regulated 
by highly specific RNA structural motifs found in the 
5’ and 3’ UTRs and associated interactions with 
virally encoded and/or host proteins. The 5’ leader 
RNA appended to all sgRNAs corresponds to the 
extreme 5’ end of the 5’ UTR and is composed of two 
stem-loop structures termed SL1 and SL2 just up- 
stream of the TRS that defines the leader-body 
junction.'? Characterization of nonviable viruses 
that harbor mutations in SL1 (this work) or SL2”° 
reveals that these genomes are generally WT 
with respect to the synthesis of full-length gRNA 
and anti-gRNAs; however, all are absolutely im- 
paired in sgRNA synthesis. 

A recent model for CoV replication’® postulated 
that circularization of the genome is a necessary early 
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step in sgRNA synthesis. However, there is little 
direct evidence for or against a physical association 
of the 5’ and 3’ ends of the genome that might be 
required for sgRNA synthesis, as well as its nature 
(RNA mediated, protein mediated, or both), al- 
though it has been proposed that circularization 
mediated by a protein bridge composed of cap- 
binding protein, eIF4E, e[F4G, and poly(A) binding 
protein®? might be important for initiation of the 
replicase gene translation.** During the course of 
testing the functional impact of mutations deposited 
in the MHV SL1, we have uncovered two critical 
aspects of SL1 that are required for sgRNA synthesis. 
Here, we present the first genetic evidence in support 
of a direct interaction between SL1 and the extreme 
3’ end of the genome; furthermore, the physical 
analysis of RNAs harboring second-site revertants in 
SL1 recovered from the genetically unstable SL1- 
AA35 viruses allows us to pinpoint clear structural 
requirements for the lower region of SL1 (below the 
pyr-pyr base pairs mapped by NMR spectroscopy) 
that are essential for supporting viral replication. 

Our functional data indicate that the base pairing 
of the upper portion of the stem is required for 
optimal viral replication, while the lower portion is 
weakly base paired or even unpaired. Genomes 
containing A5G/U40C or A7G/U38C mutations 
that stabilize the extreme lower portion of the stem 
could not be recovered as viable viruses. While 
deletion of A35 is still viable, this virus is genetically 
unstable and gives rise to second-site mutations in 
both the 5’ UTR and the 3’ UTR. Interestingly, all the 
second-site mutations in the 5’ UTR are located 
below the pyr—pyr base pairs and introduce non- 
canonical base pairs to maintain stem formation. 
Each of these mutants of SL1 stem structures are less 
thermodynamically stable compared with the 
genetically unstable SL1-AA35 mutant, which 
forms a fully base-paired helical stem. These muta- 
tions also introduce conformational heterogeneity in 
this region as well. Interestingly, the MHV SL1-AA35 
revertant mutants bear some resemblance to the 
predicted secondary structure of a WT SL1 from a 
related group 2 CoV, HKU1.'**° HKU1 SL1 does not 
contain an extrahelical nucleotide, but instead is 
characterized by tandem-predicted AeC and GeA 
mismatches in precisely the same region where 5’ 
UTR SL1 second-site mutations map in MHV. 

In addition to the global thermodynamic stability 
information, the measurement of imino proton ex- 
change gives important insights into the differences 
in the dynamics or flexibility of individual base pairs 
in different AA35 RNAs. The k,,, results show that the 
lower half of the SL1 is kinetically labile, and the three 
second-site revertant mutants recover one or more 
dynamic characteristics of the WT* RNA relative to 
the AA35 mutant. Indeed, the closer that the site of 
mutation localizes to A35, the more similar the dyna- 
mic properties become relative to the WI* RNA. 
Therefore, a less stable and more flexible SL1 might 
facilitate a specific interaction between the single- 
stranded RNA in this region and a host-encoded or 
virally encoded protein(s) that is crucial for genome 


circularization and replication. A cartoon model that 
graphically illustrates this idea is shown in Fig. 9, 
where the structure, conformational ensemble, or 
longer-lived partially opened form(s) of SL1, sche- 
matized by Al’ relative to fully base-paired A 
structure, lowers the energy barrier for formation of 
a complex with hypothetical protein B, which in turn 
drives genome circularization and sgRNA synthesis. 


Genetic interaction between the 5’ UTR and the 
3’ UTR 


We note that four of five SL1 second-site muta- 
tions recovered from SL1-AA35 viruses (U33C, 
C34U, A36U, and C37U) map specifically to the 3’ 
strand of SL1, with the lone exception being G10A; 
all recovered viruses therefore maintain a base- 
paired 5’-"GAGYR” sequence in the 5’ portion of 
SL1 in MHV. This result suggests that the nucleotide 
sequence of the 5’ region of SL1 may also be im- 
portant for viability. While this motif is not abso- 
lutely essential when the base of SL1 is predicted to 
be unpaired (CGAG® to °CUC® in the SL1-C virus; 
Table 1), a lethal mutation results when the same 
mutation is introduced while maintaining base 


Al 


Gy 


genome 
( circularization 


AB 


Fig. 9. Model of a dynamic SL1 that is consistent with 
the functional and structural data presented here. The fully 
based-paired SL1 (A; modeled by the AA35 RNA) exists in 
equilibrium with one or more higher-energy conformers 
(A’; WT* and AA35 second-site revertants) that are par- 
tially unfolded or that experience dynamic destabilization 
as a result of noncanonical pairing. A hypothetical protein 
(B) binds to both A and A’ to form the same partially 
unwound AB complex, but the affinity of B for A’ will be 
higher, since the full energetic cost of unfolding the lower 
stem will not have to be paid; this interaction then mediates 
a long-distance RNA-RNA, RNA-protein, or protein— 
protein interaction, which is crucial for the viral replication. 
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pairing here (in the SL1-CD virus; Table 1) (ie., ina 
structural context most similar to that of WT MHYV). 
This short sequence motif is conserved and predicted 
to be base paired in all group 2 CoV genomes, with 
the exception of SARS-CoV, which has an 5’-AGGU 
sequence in what is predicted to be a region of SL1 
weakly paired with the A extrahelical.!? Strikingly, 
this SL1 5’-°GAG® sequence is identical with the 5’- 
GAG sequence recovered from second-site muta- 
tions codeposited in 3’ UTR in the same viruses; this 
suggests the possibility that an oligomer of the same 
protein(s) that is capable of recognizing and/or 
“melting out” this short purine-rich motif may well 
play an important role in mediating a physical 
interaction between the 5’ UTR and the 3’ UTR. An 
excellent candidate for this role is hnRNAP A1, since 
both 3’ UTR mutations occur in exactly the same 
sequence context (5’-GAAG, a near-consensus bind- 
ing site for A1), and each replaces the second A witha 
G (a consensus hnRNAP A1 binding site),°°°” with 
the 5’-AG as the key specificity determinant. hnRNAP 
A1 possesses RNA chaperone or helix destabilization 
activity and has been shown to bind the 3’ UTR in 
MHYV, while hnRNP Al and PTB both bind to the 
complementary strands at the 5’ end of MHV RNA. 
There is some evidence to suggest that these proteins 
together mediate the formation of an ribonucleopro- 
tein complex involving the 5’- and 3’-end fragments of 
MHV RNA in ditro.°® 

Our data are compatible with the three-step work- 
ing model of CoV transcription recently refined by 
Zuniga et al., in which the first step is the formation 
of a 5’-3’ UTR complex through protein-RNA and 
protein-protein interactions, by which the TRS-L 
would be located in close proximity to sequences 
located at the 3’ end of gRNA and, in turn, make 
template jumping from individual TRS-B sequences 
favored.'> Overall, we hypothesize that SL1 must 
possess an optimum stability or kinetic lability re- 
quired to mediate a key long-range physical interac- 
tion between the 5’ UTR and 3’ the UTR that is critical 
specifically for sg RNA synthesis, but not required for 
genomic minus-strand synthesis. This finding sug- 
gests that the structural requirements and/or the 
nature of the assembled replication complexes that 
are capable of synthesizing full-length gRNA is 
distinct from those that are capable of synthesizing 
sgRNAs. Such a functionally bipartite structure, 
where a subregion of a helical stem below tandem 
wobble pairs is tuned to an optimum stability, is 
reminiscent of stem I of the U2 snRNA; here, these 
features have been hypothesized to allow U2 to adopt 
a number of mutually exclusive folded conformations 
during spliceosome assembly and catalysis.*” 


Materials and Methods 


Virus and cells 


DBT cells were maintained at 37 °C and 5% CQO, in 
DMEM supplemented with 10% calf serum (HyClone, 
Logan, UT). L2 cells were maintained at 37 °C and 3% CO, 


in DMEM supplemented with 10% calf serum. Baby 
hamster kidney-21 cells expressing the MHV receptor 
(BHK-R cells) were grown in DMEM supplemented with 
10% calf serum, 3% tryptose phosphate broth, and G418 
(800 ng ml ') to select for cells expressing the MHV 
receptor. MHV-A59 1000 was used as a WT control virus 
for comparison with chimeric viruses. 


Assembly of a full-length MHV-A59 infectious 
construct 


The reverse genetic system for MHV-A59 has been 
described.*” cDNAs representing the entire MHV-A59 
genome with either the WT sequence or the MHV SL1 
mutant sequences were constructed by ligation of the A 
fragments to fragments B-G, transcribed with T7 RNA 
polymerase, and electroporated into cells as described 
previously.'””° Cultures were observed for up to 72 h for 
the development of cytopathic effect and harvested by 
freezing at —70 °C. Cultures that did not develop cyto- 
pathic effect were blind passed three times through DBT 
cells in a further attempt to recover infectious virus. At 
least three independent experiments, including at least 
one experiment in which electroporated cells were incu- 
bated at 34 °C and 40 °C, were performed before a mutant 
genome was considered nonviable. 


Plasmid constructions 


The plasmid carrying the MHV-A59 A clone (plasmid 
A)*° was used to introduce mutations into the MHV 5’ 
UTR. Briefly, a 0.55-kb BamHI-Mlul fragment of A 
plasmid was amplified and cloned into pGEM-T vector 
(Promega) to produce pWt5’ UTR-PG, which harbors a T7 
promoter, the 209-nt MHV-A59 5’ UTR, and 257 nt of the 
5’ Orfla coding sequence. Most mutations were intro- 
duced into pWt5’ UTR-PG with the Quick Change site- 
directed mutagenesis kit (Stratagene) in accordance with 
the manufacturer’s instructions and confirmed by sequen- 
cing. An oligonucleotide assembly strategy” was used to 
introduce the A5G, U40C, and A(C16,C19,C20,A35) muta- 
tions into pWt5’ UTR-PG. The Mlul-BamHI fragments 
containing the desired mutations were introduced into 
plasmid A by restriction fragment exchange. The region 
between the Mlul and the BamHI sites in the resulting 
cloned plasmids was sequenced to verify that the desired 
mutation was recovered. The sequences of the mutagenic 
oligonucleotides are shown in Supplementary Table 1. 


5' rapid amplification of cDNA ends 


Mutant viruses were subjected to one round of plaque 
purification and were expanded once in DBT cells. Total 
RNA (2 1g) was reverse transcribed using oligo 8 (Supple- 
mentary Table 1). Resultant RT products were purified 
using the QIAquick PCR Purification Kit (Qiagen), then 3’ 
tailing was conducted using Terminal Transferase (Roche) 
to add a poly(A) tail to the 3’ ends of the purified RT 
products. The poly(A)-tailed RT products were repurified 
using the QIAquick PCR Purification Kit. A first round of 
PCR was performed using 10 1l of the poly(A)-tailed RT 
products with the adapter primer (AP-dT17) and the 
MHV-1RV5 primer. A second round of PCR was per- 
formed with 1 11 of a 1:50 dilution of the previous reaction 
using the adapter primer and the SL1 check (—) primer 
(Supplementary Table 1). The PCR products were gel 
purified and sequenced using the SL1 check (—) primer. 
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RNA analysis by RT-PCR 


Replicate cultures of BHK-R cells were electroporated in 
parallel with either MHV SL1 mutant genomes, WT ge- 
nomes, or the nsp12-FS mutant (a construct containing a 
frameshift mutation in nsp12 rendering it incapable of 
directing virus-specific RNA synthesis*°) genome, and total 
RNAs were extracted at various times (hours) postel- 
ectroporation (hpe). To determine whether the input 
RNAs, plus any replicated genome RNA, were present in 
the electroporated cells, the extracted RNAs were primed for 
RT by oligo A59(—) 16596-16577 and amplified by PCR using 
oligo A59(+) 14639-14658 and oligo A59(—) 16596-16577. 
Resultant PCR products were further amplified by nested 
PCR using oligo A59(+) 16038-16059 and oligo A59(-) 
16596-16577. The RNA species present in cells were further 
characterized by RT-PCR as described previously.'?”° 
Parallel reactions in which reverse transcriptase was omitted 
from the cDNA step were always performed to ensure that 
the PCR assay did not detect residual DNA transcription 
templates that entered the cells during electroporation. 
Quantitative RT-PCR was conducted to determine whether 
nonviable MHV SL1 mutant genomes synthesize positive- 
stranded gRNA (+gRNA). Total RNAs were extracted at 
4 hpe, 8 hpe, and 12 hpe, and poly(A)* mRNAs were 
purified using the Oligotex mRNA Midi Kit (Qiagen). Poly 
(A)* mRNAs (180 ng) were used as template for the 
synthesis of genomic cDNA, glyderaldehyde-3-phosphate 
dehydrogenase (G3PDH) cDNA with primer A59(—) 16577- 
16596, and GAPDH-R2, respectively, using the SuperScript II 
RNase H-Reverse Transcription Kit (Invitrogen). Then, 
quantitative PCR was followed to amplify +gRNA with 
primer A59(+) 16038-16059 using iQ™ SYBR Green Super- 
mix (Bio-Rad). In parallel, GAPDH was also amplified with 
primers GAPDH-F2 and GAPDH-R2. After 40 cycles, the 
PCR products were displayed by gel electrophoresis. 
Quantitative PCRs with poly(A)* selected RNA demon- 
strated that samples were free of detectable DNA. 


Preparation of RNA samples for physical studies 


RNAs were obtained by in vitro runoff transcription 
using SP6 RNA polymerase and purified by denaturing 
PAGE essentially as previously described.*! The NMR 
samples were subjected to exhaustive dialysis into a final 
buffer of 10 mM potassium phosphate (pH 6.0) and 100 4M 
2,2-dimethyl-2-silapentane-5-sulfonate (DSS), with RNA 
concentrations ranging from ~1mM to ~2.5 mM in 300 pl 
(10% DO). The RNA samples for thermal denaturation 
experiments and calorimetry experiments were prepared 
by dilution into a final dialysis buffer of 10 mM potassium 
phosphate (pH 6.0), 100 mM KCI, and 5 mM MgCl. Before 
each experiment, the RNA samples were annealed by 
heating at 65 °C for 10 min, followed by slow cooling at 
room temperature. All samples were =>90% monomeric, as 
judged by nondenaturing PAGE. 


Thermal denaturation experiments 


RNA melts were collected on a Cary 1 scanning spec- 
trophotometer operating in double-beam mode. The RNA 
concentrations were between 1 4M and 20 uM, and all 
melting profiles were shown to be independent of RNA 
concentration over this range. The first derivative data of 
absorbance at 260 nm and 280 nm with respect to 
temperature (dA/dT) were subjected to a simultaneous 
nonlinear least-squares fit of AHi, tm,i, and Aj, for each ith 


unfolding transition via sequential interacting two-state 
unfolding transition model using the t-melt program 
running on a Silicon Graphics O2 workstation as pre- 
viously described.”* Melting profiles were subjected to 
single- or two-transition unfolding models, as described in 
the text. Parallel thermal melts (50-100 uM RNA strand) 
were carried out on a Microcal VP-DSC scanning calo- 
rimeter under the same solution conditions as previously 
described.” Analysis of these baseline-corrected data with 
single-transition two-state unfolding model gave fitted 
parameters in qualitative agreement with those obtained 
from analysis of the optically monitored thermal unfolding 
experiments (data not shown). 


Saturation transfer solvent exchange experiments 


NMR spectra were acquired on a Varian Inova 500-MHz 
spectrometer at the Biomolecular NMR Laboratory, Texas 
A&M University. The data were processed using 
NMRPipe*’ and analyzed using SPARKY (SPARKY 3; 
T.D. Goddard and D.G. Kneller, University of California, 
San Francisco)§. The imino proton resonances were 
assigned by the jump-return echo 1D and Watergate 
homonuclear 'H-'H nuclear Overhauser enhancement 
spectroscopy spectra (Tmix=300 ms), with reference to an 
internal standard DSS at 10 °C. The imino proton exchange 
rates were obtained by transfer of magnetization from 
water." The exchange was initiated by selectively inverting 
the water proton resonance using a Gaussian 180° pulse 
(5.3 ms duration), with exchange delay times ranging from 
2 ms to 850 ms, followed by a weak gradient (0.1 G cm’) 
applied during the exchange delay to minimize effects due 
to radiation damping. At the end of the exchange delay, a 
second Gaussian pulse (2.8 ms) was applied to bring the 
water magnetization back to the z-axis. Imino proton 
resonances were detected using a gradient-enhanced spin- 
echo sequence. The acquired spectral array was processed 
using NUTS (Acorn NMR, Inc.), and the intensities of the 
imino proton resonances of interest were fitted to obtain the 
imino proton solvent exchange rate k,. using Kaleidagraph 
(Synergy Software) as previously described.**” 
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